Voice activity detection algorithm based on long-term pitch information

نویسندگان

  • Xu-Kui Yang
  • Liang He
  • Dan Qu
  • Wei-Qiang Zhang
چکیده

A new voice activity detection algorithm based on long-term pitch divergence is presented. The long-term pitch divergence not only decomposes speech signals with a bionic decomposition but also makes full use of long-term information. It is more discriminative comparing with other feature sets, such as long-term spectral divergence. Experimental results show that among six analyzed algorithms, the proposed algorithm is the best one with the highest non-speech hit rate and a reasonably high speech hit rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Pitch Estimation of Singing Voice From Monaural Popular Music Recordings

A singing voice separation system is a hard yet popular task in the field of music information retrieval (MIR). If successfully separated, a number of algorithms can be applied to vocal melody for any possible application. In this study, we applied a pitch estimation algorithm after separating a singing voice from background music based on the implementation of REPET [1]. Then we evaluated our ...

متن کامل

A Novel Voice Sensor for the Detection of Speech Signals

In order to develop a novel voice sensor to detect human voices, the use of features which are more robust to noise is an important issue. Voice sensor is also called voice activity detection (VAD). Due to that the inherent nature of the formant structure only occurred on the speech spectrogram (well-known as voiceprint), Wu et al. were the first to use band-spectral entropy (BSE) to describe t...

متن کامل

Low-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra

Detection of voiced speech and estimation of the pitch frequency are important tasks for many speech processing algorithms. Pitch information can be used, e.g., to reconstruct voiced speech corrupted by noise. In automotive environments, driving noise especially affects voiced speech portions in the lower frequencies. Pitch estimation is therefore important, e.g., for in-car-communication syste...

متن کامل

A wavelet- and neural network-based voice interface system for wheelchair control

Voice control has long been considered as a natural mechanism to assist powered wheelchair users. However, one implementation difficulty is that a voice input system may fail to recognise a user’s voice. Indeed, speech activated interface between human and autonomous/semi-autonomous systems requires accurate detection and recognition. In this area pitch and end-point detection is of vital impor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Audio, Speech and Music Processing

دوره 2016  شماره 

صفحات  -

تاریخ انتشار 2016